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ANALYSIS OF NUCLEOTIDE POLYMORPHISMS AT A SITE 

Field of the Invention 

The present invention relates to methods of determining the identity of a polymorphic nucleotide in a target 
5 sequence having at least two variants such as a single nucleotide polymorphism, or SNP. The methods of the present 
invention utilize primers having sequences complementary to the region upstream of the position being analyzed. 
Extension of primers hybridized to target sites is carried out in the absence of a deoxyribonucleoside triphosphate 
(dNTP) or ribonucleoside triphosphate (rNTP) complementary to one of the polymorphic nucleotides. Differences in 
length between the primers and any extension products reveal the identity of the nucleotide present at the 
10 polymorphic site. 

Background of the Invention 
DNA polymorphism can be due to differences in sequence or in length of a genomic region. Approximately 
80% of human DNA polymorphisms are sequence polymorphisms, while only about 20% are length polymorphisms. 
About 90% of sequence polymorphisms are single nucleotide polymorphisms (SNPs). SNPs are genetic variations that 

15 arise from differences in the identity of a single nucleotide in a nucleic acid sequence* ghring rise to two variants 
(sometimes called alleles) of that site. Sites having three polymorphic nucleotides have also been detected. SNPs 
appear to be the most widely distributed genetic markers in the human genome, occurring approximately every 
kilobase. Since SNPs represent the most common type of DNA sequence variation, the ability to discriminate between 
variants of these genetic markers is a very important tool in genetic research. 

20 Many inherited diseases are the result of single point mutations at SNP sites. In some cases, the single 

point mutation causing nucleotide substitution in a protein-encoding gene is sufficient to actually cause the disease, as 
in sickle cell anemia and hemophilia. For diseases influenced by a large number of genes, including diabetes, heart 
disease, various cancers, and certain psychiatric disorders, SNPs are studied as markers to aid scientists in creating 
detailed maps of genetic variation to help find disease-linked genes. 

25 SNP markers can be used to identify genes involved in disease or associated with any detectable phenotype 

by identifying the variant bases of one or more SNPs that correlate with the presence, absence, or degree of severity 
of the condition. DNA samples are isolated from individuals with and without the disease, and the identity of the 
polymorphic bases of one or more SNPs from each population are determined. The variants having a statistical 
association with the disease or phenotype are identified. Thereafter, samples may be taken from individuals and the 

30 variant bases of one or more SNPs associated with a disease or phenotype can be identified to determine whether the 
individuals are likely to develop a particular disease or phenotype, or whether they already suffer from a particular 
disease or possess a particular phenotype. Mapping SNP markers associated with a disease or phenotype to their 
chromosomal locations can identify the genes in which they occur, or indicate nearby genes having a role in the 
development or severity of the disease. By developing a high-density SNP map of the human genome, scientists hope 

35 to be able to pinpoint the .genetic origins of diseases, the genetic differences that predispose some individuals to 
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disease and underlie variations in individual responses to treatment and, potentially, to predict the most appropriate 

drugs to treat disease in individuals of a given genetic makeup. 

Both the high frequency and wide distribution of SNPs in the human genome makes them a valuable source 

of biallelic markers for identity testing, genome mapping, and medical diagnostics. SNPs are densely spaced in the 
5 human genome, with an estimated number of more than 10^ sites scattered along the 3x10^ base pairs of the human 

genome. Because SNPs occur at a greater frequency and with more uniform distribution than other classes of 

polymorphisms such as variable number of tandem repeat (VNTR) polymorphisms or restriction fragment length 

polymorphisms (RFLPs), there is a greater probability that SNP markers will be found in close proximity to a genetic 

locus of interest. SNPs are also preferred as markers because they are mutationaliy more stable than VNTRs, which 
10 have a high mutation rate. In addition, genome analysis using VNTRs and RFLPs is highly dependent on the method 

used to detect the polymorphism, while new SNPs can easily be detected by sequencing- either random sequencing to 

detect new SNPs or targeted sequencing to analyze known SNPs. 

The different forms of a characterized SNP are easy to distinguish and can therefore be used on a routine 

basis for genetic typing based on polymorphisms within and between individuals. SNPs correspond to a locus where 
15 the sequence differs by a single nucleotide and has only two alleles, making SNPs suitable for highly parallel detection 

and automated scoring. These features offer the possibility of developing rapid, high throughput genotyping using 

SNP analysis. 

At present, SNPs can be characterized using any of a variety of methods. These methods include direct or 
indirect sequencing of the site, oligonucleotide ligation assays (OLAs), ligase/polymerase analysis, use of allele-specific 

20 hybridization probes, use of dideoxyribonucleoside triphosphates (ddNTPs) for extension in solution or on solid phase, 
or use of restriction enzymes to map SNPs. A significant disadvantage of the oligonucleotide ligation assay (OLA) is 
that this method requires each possible variant of the SNP to be analyzed using a separate set of oligonucleotides for 
each nucleotide. The main drawback of OLA is that ligation is not a highly discriminating process, such that non- 
specific signals can occur with an unacceptably high frequency. Techniques such as sequencing, ligase/polymerase 

25 analysis, or restriction enzyme mapping are laborious, ttme-consuming> and often quite expensive for large-scale 
analysis. Methods such as extension in solution or on solid phase, or ligase/polymerase analysis, rely on incorporation 
of expensive ddNTPs or labeled dNTPs between bases at a polymorphic site. Since the signal is proportional to the 
number of ddNTPs or labeled dNTPs incorporated, these methods are often not sensitive enough to be used for routine 
analysis. For extension on solid phase, primers must first be immobilized to a solid support. Use of a solid support 

30 often interferes with hybridization of a primer to the target sequence. 

A rapid, accurate, and cost-effecthre method is needed to meet demands for automated high-throughput 
analysis of SNPs. 

Summary of the Invention 

The present invention provides a simple and effective method for determining the identity of the nucleotide 
35 present at a polymorphic site. 
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The invention involves detection of reaction products of a primer that hybridizes upstream of the 
polymorphic site. ONA polymerase or RNA polymerase is used to extend the primer in the absence of a dNTP or rNTP 
complementary to one of the variants at the polymorphic site, for example a SNP. 

In detail, the invention provides a method for determining the identity ef a nucleotide at a polymorphic site, 
5 said method comprising the following steps: 

1) Oligonucleotide hybridization: Oligonucleotides having a nucleotide sequence complementiry to that of a 
target molecule known to contain a SNP are hybridized in a manner such that the 3' terminus of the hybridized 
oligonucleotide is upstream of the preselected site. 

2) Polymerase extension: The hybridized primer is incubated with DNA or RNA polymerase and one or more 
10 dNTPs or rNTPs, under conditions sufficient to permit template dependent polymerase incorporation of the dNTP or 

rNTP to the 3' terminus of the hybridized oligonucleotide. Extension reactions are performed in the absence of a dNTP 
or rNTP complementary to one of the variants. Extension of the primer to the polymorphic position depends upon 
whether the reaction mixture contains the dNTP or rNTP complementary to the variant present at the preselected 
site. Preferably, the dNTPs or rNTPs included in these reactions are not detectably labelled or modified with a moeity 
15 recognized by a detectably labelled agent. Thus, preferably the dNTPs or rNTPs used in these reactions do not include 
detectable labels such as radioactive or fluorescent tags, or a moiety capable of binding a detectable label such as 
Iriotin, avidin, streptavidin, a ligand, a hapten, or an antibody. In addition, the dNTPs or rNTPs preferably do not 
include modifications which prevent further extension of the primer, for example as in dideoxyribonucleoside 
triphosphates (ddNTPs). 

20 3) Analysis: The reaction products are analyzed to determine whether the primer has been extended to the 

polymorphic position. Because the extension reactions are performed in the absence of a dNTP or rNTP 
complementary to one of the variants, the reaction product will no*, include the polymorphic base if the reaction 
mixture lacks the complementary nucleotide. Thus, the length of the reaction product depends on whether extension 
proceeded to the polymorphic position. Suitable methods for analysis include any convenient means of determining 
25 the length of the reaction product, including HPLC, capillary electrophoresis, microfluidics technology, or slab gel 
electrophoresis. The primers and extension products may be detected using well known methods, including use of 
intercalators, DNA-binding dyes, or UV light. 

Brief Description of the Drawings 
Figure 1 shows an electropherogram of primer A before the primer extension reaction 
30 Figure 2 shows an electropherogram of primer B before the primer extension reaction. 

Figure 3 shows an electropherogram of reaction products from DNA homozygous for the 6 mutation of exon 
8 of hMSH2. The first peak corresponds to primer A (PA), the second peak corresponds to its extension product 
(PAD, and the third peak corresponds to primer B (PB). 
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Figure 4 shows an electropherogram of reaction products from DNA homozygous for the T mutation of exon 
8 of the hMSH2 gene. The first peak corresponds to primer A (PAL and the second group of peaks corresponds to 
primer B (PB) and its extension products (PB1. PB2, PBS). 

Figure 5 shows an electropherogram of reactions from DNA heterozygous for both the G and T mutations of 
5 exon 8 of the hMSH2 gene. 

Detailed Description of the Preferred Embodiment 
The invention provides a technique for determining the identity of a nucleotide at a polymorphic site having 
at least two variants by performing one or more primer extension reactions and detecting reaction products. The 
variants at the polymorphic site may be associated with a detectable trait such as a disease. The first step 

10 encompasses obtaining a DNA sample suitable for use in extension reactions. Techniques for DNA isolation from any 
organism are well known and are routine practice for one skilled in the art. 

In one embodiment, a primer is hybridized to a target sequence such that its 3' end is upstream of a 
polymorphic site having two variants, an extension mixture lacking a dNTP or rNTP complementary to one of the 
variants is added, and a primer extension reaction is carried out with DNA polymerase or RNA polymerase. The primer 

15 may hybridize such that its 3. end is about 1« 2^3, 4, 5, 10, 15, 20, or more than 20 nucleotides upstream of the 
polymorphic site. However, it will be appreciated that the primer may hybridize at any location consistent with its 
intended use. Two parallel reactions are performed, each lacking one of the dNTPs or rNTPs complementary to one 
variant. As an illustrative example, a primer hybridizes with its terminus upstream of a target sequence containing a 
SNP in which one variant of the polymorphic base is A and the other variant of the polymorphic base is T. The 

20 sequence between the 3' terminus of the primer and the polymorphic site consists of C6GC; thus, the sequence 
. downstream of the primer for the A variant would be CGGCA, and the sequence downstream of the primer for the T 
variant would be CGGCT. If the primer extension reaction were carried out in the presence of dCTP, dGTP, and dATP, 
the primer would be extended by 5 nucleotides or more in reactions performed on DNA samples containing the T 
variant and by only 4 nucleotides in reactions performed on DNA sarnples containing the A variant. In the presence of 

25 dCTP, dGTP, and dTTP, the primer would be extended by only 4 nucleotides for the T variant, and by 5 nucleotides or 
more for the A variant. 

In preferred embodiments of the invention, the primers hybridize to the target sequence such that their 3' 
ends are immediately adjacent to the polymorphic site, and the reaction mixture contains a single dNTP or rNTP 
complementary to one of the variants, such that the 3 ' terminus of a hybridized oligonucleotide is extended by a 
30 single nucleotide if the reaction mixture contains a dNTP or rNTP complementary to the polymorphic base in the 
sample. If the reaction mixture does not include a dNTP or rNTP complementary to the polymorphic base in the 
sample, the primer will not be extended. In some embodiments, when the target molecule has a homosequence such 
that the nucleotides following the polymorphic site are the same as the nucleotide at the polymorphic site, the 
hybridized oligonucleotide is extended by several nucleotides. 
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The invention encompasses embodiments wherein, after a DNA sample is obtained, an amplification reaction 
is performed using primers on each side of the polymorphic site in order to dramatically increase the quantity of 
material containing the polymorphic site being analyzed. The amplification products are then used in the methods of 
the present invention. Highly preferred amplification methods include the use of polymerase chain reaction (PGR) to 
5 amplify DNA sequences, followed by a purification step to remove unincorporated dNTPs. Such a purification step 
could include the use of column chromatography, for example chromatography over Sephadex media, reaction with 
exonuclease and alkaline phosphatase, or any other method known to one skilled in the art. In a highly preferred 
embodiment, after removal of unincorporated dNTPs with exonuclease and alkaline phosphatase, the PGR product 
containing the polymorphic site is combined with oligonucleotide primer. An extension mixture comprises 20 mM Tris, 

10 10 mM MgCljr 20 mM NaCt, 500 M of one dNTP and 1 unit of Sequenase (Taq polymerase). Extension reactions are 
performed for 10-15 minutes and 25-30 extension cycles. 

After the extension reaction is performed, the reaction products are detected to determine whether the 
primer has been extended to the polymorphic site. Preferably, reaction products are analyzed by HPLC or capillary 
electrophoresis, using DNA-binding dyes, RNA binding dyes, or intercaiators, e.g., oligogreen, phenanthridines, or 

15 acridine, to enhance detection. Preferably, microfluidics technology can be used to analyze the reaction products. In 
another embodiment, reaction products are analyzed by separation in slab gel or capillary electrophoresis and detected 
by the use of a DNA-binding dye. Suitable DNA binding dyes include ethidium bromide, DAPI, cyanine dyes, silver 
stain, indoles and imidazoles. Preferred commercially available dyes include unsymmetrical cyanine dyes available 
from Molecular Probes, Inc., such as SYBR^ Green I and II, SYBR^" Gold and ViSTRA^ Green. In another preferred 

20 embodiment, the 5' end of the primer is labeled with a radioactive or nonradioactive tag. For example, the 5' end of 
the primer may be labelled with ^^P using nucleotide kinase. Reaction products are detected by autoradiography or 
other suitable methods for detecting radioactively labeled products. 

For DNA isolated from diploid organisms such as humans, three different genotypes are possible: 
homozygous for one allele; homozygous for the other allele, or heterozygous. For analysis of DNA from diploid 

25 organisms, a preferred embodiment of the present invention comprises duplication of the hybridization and extension 
reactions described above, followed by separate or pooled analysis of the reaction products. In one embodiment, the 
same oligonucleotide primer can be used in separate, parallel reactions where each reaction uses a different 
deoxyribonucleoside triphosphate. For example, for a C/A polymorphism, the same primer can be used in two 
reactions, with one reaction mixture containing dGTP and lacking dTTP, and the other reaction containing dTTP and 

30 lacking dGTP. Samples from homozygous individuals will contain an extension product including the polymorphic base 
in only one reaction, whereas samples from heterozygous individuals will contain an extension product including the 
polymorphic base in both reactions. 

In another embodiment, the identity of variant nucleotides at a SNP site in DNA from a diploid organism can 
be identified by using two oligonucleotide primers of different length. Preferably, only one primer is used in each 

35 reaction. The products of separate reactions can be analyzed separately, or pooled and analyzed simultaneously. For 
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example, to analyze a C/A polymorphism, a first extension reaction comprises a 20*base primer hybridizing with its 3' 
end immediately adjacent to the polymorphic site and reacted in the presence of dGTP and the absence of dTTP. A 
second reaction comprises a 25-base primer also hybridizing with its 3' end immediately adjacent to the polymorphic 
site and reacted in the present of dTTP and the absence of dGTP. Separate extensions are performed on aliquots of 

5 the same DNA sample, one reaction containmg the 20-base primer and dBTP, but lacking dTTP, and the other 
containing the 25-base primer and dTTP, but lacking dGTP. If the organism from which the DNA was extracted is 
homozygous for the C variant, then the reaction containing dGTP but lacking dTTP would produce an extension 
product including the polymorphic base, and the reaction containing dTTP but lacking dGTP would produce no 
extension product. If the two reactions are pooled for analysis, the pooled reaction products would include 20-base 

10 primer and 21 -base dGTP extension product from the dGTP reaction, and the 25-base primer from the reaction 
containing dTTP. If the organism from which the DNA was extracted is homozygous for the A variant, pooled reaction 
products would include the 25'base primer and 26*base extension product from the reaction containing dTTP, and only 
the unextended 20-base primer from the reaction containing dGTP. If the organism in question is heterozygous at this 
SNP, then both the extension reactions would give extension products, producing a pooled mixture of the 20-base 

15 primer, 21 -base extension product including the G at the polymorphic base,^ 25-base primer, and 26-base extension 
product including the T at the polymorphic base. 

In another embodiment of the invention, the identity of multiple polymorphic sites in a DNA sample can be 
determined using multiple primers. Multiple polymorphic sites may serve as markers for a genetic disorder or 
phenotype, and subtypes of the disorder or phenotype may be distinguishable by determining the nucleotide occurring 

20 at each of a plurality of known polymorphic sites. Alternatively, the presence of a particular combination of variants 
in several SNPs may indicate that an individual is suffering from a particular disease or has a predisposition for a 
particular disease. In addition, a sample from an individual may be analyzed to determine whether the individual is at 
risk of developing a plurality of diseases or phenotypes, or currently possesses a plurality of diseases or phenotypes. 
Thus, the identities of the polymorphic bases of a SNP or combination of SNPs, each of which is associated with a 

25 different disease or phenotype, are determined in the sample. In one version of this embodiment, the primer for each 
site has a distinct length, such that the primer and any extension products can be distinguished from other primers 
annealing to other sites and their extension products. Preferably, each primer hybridizes with its 3' end immediately 
upstream of the polymorphic site. In a preferred embodiment, a sample of DNA having multiple polymorphic sites is 
mixed with primers of differing lengths, where each primer hybridizes upstream of one polymorphic site. The mixture 

30 is divided, one dNTP or rNTP is added to each reaction, primer extension is carried out, and the reaction products of 
each reaction are analyzed. In a highly preferred embodiment, four separate reactions are carried out using either 
dATP, dTTP, dCTP, or dGTP. Alternatively, each reaction may contain 2 or 3 dNTPs or rNTPs, such that each 
reaction lacks one dNTP or rNTP. A primer will be extended only if the reaction contains the dNTP or rNTP 
complementary to the polymorphic nucleotide at the site present in the sample. In this embodiment, a primer will not 

35 be extended in reactions containing dNTP or rNTP complementary to the other variants of that site which are not 
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present m the sample, nor will the primer be extended when the reaction contains a dNTP or rNTP complementary to 
nucleotides that do not occur at that site. The products of each reaction may be analyzed separately or, depending on 
the particular polymorphic sites being tested, may be pooled for analysis. This invention encompasses embodiments 
using multiple primers to identify the polymorphic nucleotides at a plurality of sites in DNA isolated from a diploid 
5 organism. 

In another preferred embodiment, a sample of DNA containing one or more target sequences having a 
polymorphic site is mixed with primers which hybridize to each of the strands of the of the target sequence such that 
the 3' ends of the primers are upstream of the polymorphic site. Preferably, the primers have different lengths such 
that their extension products can be distinguished. Thus, at each polymorphic site, two primers, each of which 

10 hybridize to one of the strands containing the polymorphic nucleotide, can be used to identify the variant or variants 
present in the target DNA. Preferably, one primer hybridizes to the first strand such that its 3' terminus is adjacent to 
the polymorphic site on the first strand while the second primer hybridizes to the opposite strand, such that its 3' 
terminus is adjacent to the polymorphic site on the opposite strand. However, as discussed herein, the primers may 
also hybridize such that their 3' ends are more than one nucleotide upstream of the polymorphic site. If the primers 

15 are of differing lengths, a single extension reaction can be carried out to determine the variants present at a 
polymorphic site in a diploid DNA sample. For example, if a sample has a T/C polymorphic site on the first DNA strand, 
a first primer having 19 bases hybridizes to the first DNA strand such that its 3' terminus is adjacent to the 
polymorphic site. If the T variant is present on the first strand, this primer will be extended in the presence of dATP. 
A second primer having 22 bases hybridizes to the opposite strand, such that its 3' terminus is adjacent to the 

20 polymorphic site. If the C variant is present on the first DNA strand, then the opposite DNA strand would contain G in 
position opposite the first-strand polymorphic site, such that the second primer hybridized to the opposite DNA strand 
would be extended in the present of dCTP. Thus, a single extension reaction is carried out, where the reaction 
mixture contains both primers and dATP and dCTP but lacks dTTP and dGTP. Analysis of the reaction products 
reveals the variant(s) present at the polymorphic site: a sample homozygous for the T variant would produce 2D-base 

25 long dATP extension products of the 19-base primer that hybridizes to the first DNA strand; a sample homozygous for 
the C variant would produce 23*base-long dCTP extension products of the second primer that hybridizes to the 
opposite DNA strand; a DNA sample heterozygous for both variants would produce extension products of both primers. 
Altemathrely, if the primers are the same length, then two separate extension reactions are carried out. The first 
extension reaction contains one primer and the nucleotide complementary to one of the variants at the polymorphic 

30 site but lacks the nucleotide complementary to the other variant, while the second extension reaction contains the 
other primer and the nucleotide complementary to the other variant but lacks the nucleotide complementary to the 
first variant. If the sample is homozygous for one of the variants, an extension product will be observed in one of the 
extension reactions. However, if the sample is heterozygous, extension products will be observed in both extension 
reactions. 
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Another preferred embodiment comprises a kit or kits containing materials suitable for practidrig the present 
invention. Such kits may include, but are not Kmited to, primers which hybridize upstream of known polymorphic sites 
of interest reagents for carrying out annealing and primer extension reactions, and materials for purification and 
detection of primers and reaction products, including materials to enhance detection of reaction products. Kits may 
5 contain reagents for amplifying the polymorphic site prior to carrying out primer extension reactions. 

It will be appreciated that the present invention can be practiced using isolated RNA, RNA oligonucleotide 
primers, RNA polymerase, and rNTPs. Proper precautions to avoid degradation of the RNA can be routinely practiced 
by one of skill in the art, using well-known methods. 

Example 1 describes the use of the present invention to identify the variants of a human gene known to be 
10 involved in cancer. 

EXAMPLE 1 

The exon 8 mutation (T to G) of the hMSH2 gene was used as a model for analysis of DNA polymorphism. 
The hMSH2 gene codes for a DNA mismatch repair protein, and inherited defects in the gene have been linked with 
the development of hereditary non-potyposis colorectal cancer (NHPCC). This gene is located on chromosome 2p and 

15 is comprised of sixteen exons. Multiple mutations responsible for NHPCC have been identified and are widely 
interspersed throughout the gene. (Liu B., R.E. Parsons, S.R. Hamilton, G.M. Petersen, H.T. Lynch, P. Watson, J. 
Green, A. dela Chapelle, K.W. Kinzler and B. Vogelstein. hMSH2 mutations in hereditary nonpolyposis colorectal cancer 
kindreds. Cancer Research 54: 4590-4594 (1995), the disclosure of which is incorporated herein by reference jn its 
entirety. The sequence of the hMSH2 region containing the exon 8 polymorphic site is provided in the accompanying 

20 sequence listing as SEQ 10 NO: 1. As indicated in the sequence listing, SEQ 10 NO: 1 contains a polymorphic^base 
having two variants, T and G, at position 165. Genomic DNA samples were isolated from individuals having different 
mutations at the same site in exon 8 of hMSH2. A section of exon 8 including the polymorphic site was amplified 
from each genomic sample using the sense direction primer: 5' T6TAAAAC6ACGGCCA6T (SEQ 10 NO: 2) and the 
antisense primer 5' CAGGAAACAGCTATGACC (SEQ 10 NO: 3). Exon 8 was amplified in 20 I reaction volume using 

25 800 nM each of the respective sense and antisense PGR primers, 20 ng of genomic DNA, 200 M of each dNTP, 1 unit 
(U) Taq Gold, 10 mM Tris-HCL pH 8.3, 50 mM KCI. The first cycle was carried out at 94**C for 2 minutes, followed 
by thirty cycles of 94*" for 30 seconds, GO'^C for 1 minute, and 72''C for 30 seconds. 

The PCR product containing the polymorphic site was 234 bases in length, covering the coding region of 
exon 8 of hMSH2 and the intron region immediately flanking it. Unincorporated bases were removed using 

30 exonuclease and shrimp alkaline phosphatase in a reaction mixture containing 5 I of the PCR mixture, 2 U shrimp 
alkaline phosphatase and 2 U exonuclease I, in a 10 I final volume buffered to pH 8.0. The mixture was incubated at 
37''C for 30 minutes, and then at 94^C for 10 minutes to deactivate the enzymes. 

The identity of the polymorphic nucleotide at the exon 8 site was established using two primers of different 
lengths which hybridize with their 3' ends immediately upstream of the polymorphic base: Primer A, a 15-base primer. 
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was used to detect for the 6-mutation in an extension mixture containing only dCTP; Primer B, an IB-base primer, was 
used to detect the T-mutation in an extension mixture containing only dATP. 

Primer A: 5' ACCT6ATCCATATCT (SEQ ID NO: 4) 

Primer B: 5' CATACCT6ATCCATATCT (SEQ ID NO: 5) 
5 The extension reaction mixture contained 10 I purified PCR product containing the axon 8 polymorphic site, 

10 pmol of primer, 2 M dCTP or dATP, 1 U thermosequenase, in a 20 I reaction volume containing 20 mM Tris-HCL 
pH 9.5 and 4 mM MgClj. The mixture was denatured for 1 cycle at SS^'C for 4 minutes, followed by 20 cycles of 
primer extension (55**C for 15 seconds, 72^*0 for 4-10 seconds; 94^C for 10 seconds). The reaction was ethanol- 
precipitated or purified over Sephadex G-50. Purified reaction products were diluted in 10-20 I water or 80% 
10 formamide, and SYBR Green II (cyanine dye. Molecular Probes, Inc.) was added to a final concentration of between 
1:1000 to 1:16000, preferably 1:5000. Reaction products containing the intercalator were injected electrokinetically 
for 1 minute at 3.5 kV, and run at 7-9 kV at room temperature, using a 62 cm capillary tube (coated, 200 m 0.D./75 m 
I.D., Polymicro Technologies, Phoenix, AZ, U.S.A.) A 50 mW, 488 nm argon ion (Ar*) laser (Melies Griot, Carisbad, 
U.S.A.) was used as the excitation source, with the detection window located 40 cm from the injection site. Sieving 
15 polymers used for separation by capillary electrophoresis (CE) include linear polyacrylamide or various cellulose 
derivatives. Other suitable separation media may also be used for CE separation. 

Figure 1 shows an electropherogram of the 15-base oligonucleotide primer A before carrying out an 
extension reaction. Figure 2 shows an electropherogram of the 18-base oligonucleotide primer B before carrying out 
an extension reaction. 

20 Figure 3 shows the pooled reaction products obtained using DNA from an individual homogyzous for the G 

mutation of exon 8 of hMSH2. One extension reaction was carried out using Primer A (SEQ ID NO: 4) and only dCTP, 
while a parallel extension reaction was carried out using Primer B (SEQ ID NO: 5) and only dATP. After the reactions 
were completed, they were pooled and their products analyzed by capillary electrophoresis as described above. The 
products included the 15-base primer A (PA) and its 16-base extension product, PA1. Primer B was not extended 

25 because it was used in an extension reaction lacking the nucleotide complementary to the 6 variant of the 
polymorphic site present in the sample. 

Figure 4 shows the pooled reaction products obtained using DNA from an individual homogyzous for the T 
mutation of exon 8 of hMSH2. One extension reaction was carried out using Primer A (SEQ ID NO: 4) and only dCTP, 
while a parallel extension reaction was carried out using Primer B (SEQ ID NO: 5) and only dATP. After the reactions 

30 were completed, they were pooled and their products analyzed by capillary electrophoresis as described above. Primer 
A (PA) was not extended in the reaction lacking the nucleotide complementary to the T variant of the polymorphic site 
present in the sample. The reaction containing Primer B and dATP gave multiple extension products: unextended 18- 
base Primer B (PB); the lO base extension product formed by incorporating a nucleotide complementary to the T 
variant of the polymorphic site at position 165; and additional extension products resulting from the homosequence of 
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two Ts at positions 164 and 163, immediately downstream of the polymorphic nucleotide, giving rise to 20*base (PB2) 
and 2Vbase (PB3) extension products. 

Figure 5 shows the pooled reaction products obtained using ONA from an indnridual heterozygous for the 
variants of axon 8 of hMSH2. One extension reaction was carried out using Primer A (SEQ ID NO: 4) and dCTP, and a 
5 parallel extension reaction was carried out using Primer B (SEQ 10 NO: 5) and dATP. After the reactions were 
completed, they were pooled and their products analyzed by capillary electrophoresis as described above. One copy of 
the hMSH2 target sequence present in the sample had the G variant, such that the reaction containing Primer A and 
dCTP produced extension product PA1 from Primer A (PA). The other copy of the hMSH2 target sequence present in 
the sample had the T variant, such that the reaction containing Primer B and dATP produced the same extension 
10 products seen in Figure 4, namely Primer B (PB), PB1, PB2, and PBS. 

Example' 2 describes the use of the present invention with radioactively labeled primer to identify the 
variants of a PCR product. 

EXAMPLE 2 

The.following PCR product was used as a model system. The nucleotide indicated in bold type is located at 
15 the polymorphic site. In this sample, the A variant is present on the strand to be analyzed. 
5' GACGAATTCTAATACGACTCACTATAGGGTAAGGCCAAACGTTTAACT 3' . 
(SEQ ID NO: 8} 

5' AGTTAAACGTTTCGCCTTACCCTATAGTGAGTCGTATTAGAATTCGTC 3' 
(SEQ ID NO: 7) 

20 PRIMER: 5'TAATACGACTCACTATAGGG3'(SEQIDN0:8) 

Reaction conditions were as described in Example 1, except that the primer was radioactively labelled with • 
^ATP. After the reaction was complete, the reaction products were separated on a 20% polyacrylamide slab gel and 
products were detected by autoradiography of the gel. The lanes of the gel contained the following: Control Lane 
with labeled primer only; Lane A, reaction products with dATP added. Lane T, reaction products with dTTP added; 

25 Lane C, reaction products with dCTP added The samples in Lanes A (dATP added) and C (dCTP added) migrate the 
same distance as the 20'base primer control (Control Lane), , indicating that the primer was not extended in those 
reactions. However, the reaction products in Lane T (dTTP to reaction) shows an extension product. The primer was 
extended by addition of a single base (here, dTTP) to make a 21 -base product. The small amount of radioactivity seen 
in the 20*base position indicates that the reaction did not extend all of the radioactively labeled 20-base primer 

30 present. 
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WHAT IS CLAIMED IS: 

1. A method for determining the identity of the polymorphic nucleotide in a target sequence having at least 
two known variants, comprising: 

obtaining a sample comprising said target sequence; 
5 hybridizing a primer upstream of said polymorphic nucleotide; 

performing a first extension reaction with said hybridized primer in the absence of a 
deoxyribonucleoside triphosphate (dNTP) or ribonucleoside triphosphate (rNTP) complementary to said first 
known variant, but in the presence of at least one dNTP or rNTP wherein said at least one dNTP or rNTP 
includes a dNTP or rNTP complementary to said second known variant and wherein said at least one dNTP 
10 or rNTP is not detectably labeled or modified; 

performing a second extension reaction with said hybridized primer in the absence of a dNTP or 
rNTP complementary to said second known variant, but in the presence of at least one dNTP or rNTP, 
wherein said at least one dNTP or rNTP includes a dNTP or rNTP complementary to said first known variant 
and wherein said at least one dNTP or rNTP is not detectably labeled or modified; and 
15 analyzing the reaction products of said first extension reaction and said second extension reaction. 

2. The method of claim 1 wherein a plurality of dNTPs or rNTPs is included in said first and second 
extension reactions. 

3. The method of claim 1 wherein only one dNTP or rNTP is included in said first and second extension 

reactions. 

20 4. The method of claim 1 wherein said primer hybridizes such that its 3' end is immediately upstream of 

the polymorphic base. 

5. The method of clann 4 wherein one dNTP or rNTP is added. 

6. The method of daim 1 wherein the 5' end of said primer comprises a radioactive or non-radioactive tag. 

7. The method of claim 1, wherein said target sequence is amplified in vitro. 

25 8. The method of claim 1, wherein said step of analyzing the reaction products of said first extension 

reaction and said second extension reaction comprises determining the identity of the incorporated nucleotide which is 
complementary to said first known variant or said second known variant. 

9. The method of claim 1, wherein said step of analyzing the reaction products of said first extension 
reaction and said second extension reaction comprises determining the length of said reaction products. 

30 10. The method of claim 1, wherein said step of analyzing the reaction products of said first extension 

reaction and said second extension reaction comprises performing a technique selected from the group consisting of 
chromatography, capillary electrophoresis, microfluidic analysis, and slab gel electrophoresis. 

11. The method of claim 1, wherein the reaction products are detected using high performance liquid 
chromatography. 

35 1 2. The method of claim 1, wherein the reaction products are detected using capillary electrophoresis. 
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13. The method of claim 1, wherein the reaction products are detected using an intercalating agent. 

14. The method of claim 13, wherein said intercalating agent is ethidium bromide. 

15. The method of claim 13. wherein said intercalating agent is an unsymmetrical cyanine dye. 

16. The method of claim 1, wherein the reaction products are detected using slab electrophoresis and 
ultraviolet light. 

17. The method of claim 1, wherein the reaction products are detected using slab electiophoresis and a 
DNAbinding dye. 

18. The method of claim 1 wherein said target sequence having at least two known variants comprises a 
biallelic marker associated with genetic disorders. 

19. The method of claim 1, wherein said sample containing a target sequence having at least two known 
variants is from a diploid organism. 

20. The method of claim 1, wherein said first extension reaction is performed with a primer having a first 
length, and said second reaction is performed with a primer having a second length, said first and second lengths being 
selected such that said first primer and said second primer and any extension products thereof, can be distinguished 
from one another. . ... . ^- 

21. The method of claim 20, wherein the reaction products of said first and second extension reactions are 
analyzed separately. 

22. The method of claim 20, wherein the reaction products of said first and second extension reactions are 
pooled for analysis. 

23. A method for screening a DNA sample for a plurality of target sequences having at least two known 
variants, comprising: 

obtaining a sample comprising a plurality of know;< target sequences; 
hybridizing a primer upstream of each of said target sequences, each primer having a length such that said 
primer and any extension product thereof can be distinguished from the other primers and any extension 
products thereof; 

performing a plurality of extension reactions wherein each extension reaction contains a single free 
dNTP or rNTP species complementary to one polymorphic nucleotide of said variant, wherein said single free 
dNTP or rNTP species is not detectably labeled or modified; and 

analyzing the reaction products of each extension reaction. 

24. The method of claim 23, wherein said target sequences being analyzed are associated with genetic 

disorders. 

25. The method of claim 23, wherein said sample is from a diploid organism. 

26. The method of claim 23, wherein the products of the extension reactions are analyzed separately. 

27. The method of claim 23, wherein the products of the extension reactions are pooled for analysis. 
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28. A kit for use in determining the identity of the polymorphic nucleotide in a target sequence having at 
least two known variants, comprising: 

at least one primer that hybridizes to said target sequence such that its 3' end is upstream of said 
target sequence; 

5 a reagent for performing a primer extension reaction in the absence of a dNTP or rNTP 

complementary to said first known polymorphic nucleotide, but in the presence of at least one dNTP or rNTP 
wherein said at least one dNTP or rNTP includes a dNTP or rNTP complementary to said second known 
variant and wherein said at least one dNTP or rNTP is not detectably labeled or modified; and 

a reagent for performing a primer extension reaction in the absence of a dNTP or rNTP 
10 complementary to said second known polymorphic nucleotide, but in the presence of at least one dNTP or 

rNTP wherein said at least one dNTP or rNTP includes a dNTP or rNTP complementary to said first known 
variant and wherein said at least one dNTP or rNTP is not detectably labeled or modified. 

29. The kit of claim 28, further comprising a detection enhancer for said reaction products. 

30. The kit of claim 28, further comprising a purifier for said reaction products. 

15 31. The kit of claim 30, further comprising a detection enhancer for said reaction products. 

32. A kit for use in determining the identity of the polymorphic nucleotide in a target sequence having at 
least two known variants, comprising: 

at least one primer that hybridizes to said target sequence such that Its 3' end is immediately 
upstream of said target sequence; 
20 a reagent for perfonfning a primer extension reaction containing a single dNTP or rNTP 

complementary to said first known polymorphic nucleotide, wherein said single dNTP or rNTP is not 
detectably labeled or modified; and 

a reagent for performing a primer extension reaction containing a single dNTP or rNTP 
complementary to said second known polymorphic nucleotide, wherein said single dNTP or rNTP is not 
25 detectably labeled or modified. 

33. The kit of claim 32 further comprising a detection enhancer for said reaction products. 

34. The kit of claim 32 further comprising a purifier for said reaction products. 

35. The kit of claim 34, further comprising a detection enhancer for said reaction products. 

36. A method for determining the identity of the polymorphic nucleotide in a target sequence having at least 
30 two known variants, comprising perfonming a primer extension reaction in the absence of a dNTP or rNTP 

complementary to one of said polymorphic nucleotides but in the presence of at least one dNTP or rNTP 
complementary to the other polymorphic nucleotide, wherein said at least one dNTP or rNTP complementary to the 
other polymorphic nucleotide is not detectably labeled or modified, and detecting the reaction products of said 
extension reaction. 

35 
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SEQUENCE LISTING 



<110> 



TETRAGEN, S.A. 
Merezikova , Irena , N . 



<120> 



ANALYSIS ( 
AT A SITE 



OF NUCLEOTIDE POLYMORPHISMS 



<130> TETRAGN. 002VPC 

<150> US 09/471,703 
<151> 23-12-1999 

<160> 8 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 304 
<212> DNA 

<213> Homo sapiens 
<400> 1 

aaaattttat gatttgtatt ctgtaaaatg agatcttttt atttgtttgt ttt.actact:t 60 
tcttttagga aaacaccaga aattattgtt ggcagtitttt gtgacticctc ttactga^ct 120 
tcgttctgac ttctccaagt: ttcaggaaat gatagaaaca acttkagata tggatcaggt 180 
atgcaatata ctttttiaatt taagcagtag ttattettaa aaagcaaagg ccactttaag 240 
aaagtttgta gatttttttt tttagtatct aaatgtagca cctttgtgga cagtggatgt: 300 
aata 304 

<210> 2 
<211> 18 
<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> Oligonucleotide 



<210> 3 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 3 

^aggaaacag ctatgacc 18 

<210> 4 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



<400> 2 

tgtaaaacga cggccagt 



18 
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<400> 4 

acctgatcca tatct 15 



<210> 5 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 5 

catacctgat ccatatct 18 

<210> 6 
<211> 48 
<212> DNA 

<213> Homo sapiens 
<400> 6 

gacgaattct aatacgactc actatagggt aaggccaaac gtttaact 48 



<210> 7 
<211> 48 

<212> DNA . . , :: . 

<213> Homo sapiens 

<400> 7 

agttaaacgt ttcgccttac cctatagtga gtcgtattag aattcgtc 48 

<210> 8 
<211> 20 
<212> DNA 

<2X3> Artificial Sequence 

<220> 

<223> Oligonucleotide 
<400> 8 

taatacgact cactataggg 20 
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